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Abstract 

' It is shown that while the mutual information curves for coded modulation (CM) and bit interleaved coded 

i modulation (BICM) overlap in the case of a single input single output channel, the same is not true in multiple 

' input multiple output (MIMO) channels. A method for mitigating fading in the presence of multiple transmit antennas, 

named coordinate interleaving (CI), is presented as a generalization of component interleaving for a single transmit 
antenna. The extent of any advantages of CI over BICM, relative to CM, is analyzed from a mutual information 
perspective; the analysis is based on an equivalent parallel channel model for CI. Several expressions for mutual 
Tlj" ' information in the presence of CI and multiple transmit and receive antennas are derived. Results show that CI gives 

, higher mutual information compared to that of BICM if proper signal mappings are used. Effects like constellation 

rotation in the presence of CI are also considered and illustrated; it is shown that constellation rotation can increase 
the constrained capacity. 

c/3 , Index Terms 

O 

Constrained capacity, multiple-input multiple-output channels, mutual information, bit interleaved coded modu- 
lation, coordinate interleaving. 

> 

\ I. Introduction 



O 
O 



o 



Mitigating the effects of fading on the decoding of forward error correction codes has been studied for single 
transmit antenna configurations, and revolves around some form of interleaving. Traditionally, interleaving was 
performed directly on the coded bits present at the encoder's output. Given that coded modulation (CM) [22] is 
optimal, the recognition of the fact that modulation and coding are no longer necessarily combined when interleaving 
an encoder's output — i.e., prior to mapping the (interleaved) bits to complex (channel) alphabet symbols — has 
naturally led researchers [13] to study capacity limitations of bit interleaved coded modulation (BICM) schemes; 
see [13] and references therein for definition of BICM. In the case of single transmit antenna configurations, the 
^ ^ conclusion was that mutual information remains essentially the same — provided that Gray mapping is used to map 
■ the interleaved coded bits to modulator's points. This reinforced a potential advantage of BICM, i.e., flexibility 
from the perspective of adaptive coding and modulation schemes. Other aspects of the study in [13] related to the 
cut-off rate — but this concept has been deemed irrelevant with the advance of iterative decoding and concatenated 
coding schemes (e.g., turbo codes). As a result, BICM was deemed desirable, and was employed in standards for 
CDMA [17] and satellite systems that involve one transmit antenna. 

In parallel, an alternative to separating coding from modulation was studied for single transmit antenna con- 
figurations, and named coordinate (or component) interleaving [18]. In that approach, the real and imaginary 
coordinates were interleaved separately, and diversity was derived from the encoder's trellis (minimum Hamming 
distance); there was no difficulty reading the interleaved coordinates, because complex symbols were transmitted 
from only one transmit antenna, and the orthogonal in-phase and quadrature components could be naturally and 
separately detected'. The idea of interleaving the real coordinates, or components, of points from multidimensional 
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constellations embedded in some Cartesian product of the complex field C was discussed in [26]; therein, an 
algebraic number theoretic analysis was pursued to support the conclusion that coordinate interleaving (CI) together 
with constellation rotation can increase diversity — separately from any redundancy scheme such as forward error 
coiTcction coding. The diversity was quantified in terms of a coordinate-wise Hamming distance. In [19] the authors 
restricted the MIMO channel to a two transmit antenna configuration, and relied on constellation rotation along with 
an orthogonal space-time diversity scheme [1] to increase diversity by increasing the coordinate- wise Hamming 
distance; coordinates of a rotated complex constellation were first interleaved then mapped to orthogonal space-time 
block code matrices for two transmit antennas [1]. By allowing the symbols from the two transmit antennas to be 
easily separable at the receive antenna due to orthogonality, orthogonal space time block codes allow the approach 
in [18] to be extended to two transmit antennas. 

The problem is more complicated when multiple transmit antennas are employed, as coordinates can no longer 
be observed independently from each other due to the superposition of all transmitted complex symbols at any 
receive antenna. A version of BICM for multiple input multiple output (MIMO) channels was examined in [24], 
[25]. 

A more general scheme for CI in the presence of multiple transmit antennas was reported in [27], and summarized 
in Section II below; while the exemplary illustration in [27] was based on a two transmit antenna configuration, it 
can be applied to any number of transmit antennas. In addition, it can be employed simultaneously with a coding 
redundancy scheme (concatenated or not), does not require separating coding from modulation, and does not rely on 
constellation rotation — although rotation is not precluded; it is easily recognized to differ from channel interleaving, 
which would interleave blocks of (all) complex symbols to be transmitted from all transmit antennas during one 
MIMO channel use. This CI scheme offers a means to mitigate diversity in MIMO channels, which generalizes the 
single antenna concept in a non-obvious manner. The minimum coordinate-wise Hamming distance can be naturally 
controlled in the design of the forward eiTor connecting scheme; in essence, after coding and modulation (possibly 
combined) the real and imaginary coordinates from the complex symbols of all transmit antennas are interleaved 
over an arbitrary block length, e.g. a frame, and then transmitted as new coordinates in the respective MIMO 
configuration. Further considerations may be possible using the algebraic number theoretic formalism undertaken 
in [26]. Depending on the original complex constellation(s) used on the transmit antennas, the complex channel 
alphabet after CI may change over the wireless channel. Moreover, the coordinate interleaver can be viewed as an 
interleaver in a serially concatenated scheme, thus enabling iterations between decoding and detection in a manner 
similar to turbo decoding; this is also described in a separate work [27]. This paper is not concerned with the 
details of combining coding redundancy and CI, the latter briefly reviewed in Section II; rather, the impact of CI 
on diversity, via the minimum coordinate-wise Hamming distance, motivates one to investigate mutual information 
limits. This is the main purpose of this manuscript, which sets out to determine the extent of any advantages of CI 
over BICM, relative to CM, from a mutual information perspective. It should be noted that in this paper we are 
dealing with capacity limits for finite alphabet constellations (constrained constellations) — ^rather than the generally 
used ergodic capacity, which is often associated with the case of continuous alphabets. 

Section II reviews the mechanism by which CI affects diversity in a MIMO system. Section III examines the 
mutual information with finite size alphabets in the single transmit antenna case [13], as well as in several MIMO 
configurations; the numerical results illustrate that the fact that mutual information curves for CM and BICM 
overlap in the case of a single transmit antenna ought not to be taken for granted, as a significant gap occurs in 
MIMO channels. Section IV introduces an equivalent parallel channel model for CI, and derives several expressions 
for mutual information in the presence of CI and multiple transmit antennas. Section V-B examines effects like 
constellation rotation in the presence of CI. Conclusions are drawn in Section VI. 



II. Coordinate Interleaving for MIMO Channels - Diversity Perspective 

Consider a MIMO scenario where the number of transmit and, respectively, receive antennas are N and M. Let 
the complex symbols transmitted from the N transmit antennas during I uses of the MIMO channel be represented 
as a long vector, whose entries are superscripted by the transmit antenna index, and subscripted by the channel use 
index. Conditioned on knowledge of the channel state information (CSI), the probability of transmitting 
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at the maximum likelihood decoder is bounded as below 



Pr{c^e\aij[k],i = I, . . . , N, j = 1, . . . , M, k = 0, . . . ,1 - 1} < exp(-4(e, c)Es/4No), 

where ai,j(-) are the channel coefficients between transmit antenna i and receive antenna j, with £'{|ajj p} = 1, 
Vi,j. The key parameter is clearly 
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C[k] = (cfc - efc)(cfc - Cfc) 
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where the superscript * indicates complex conjugation. Since C[k] is Hermitian, it admits the singular value 
decomposition (SVD) 

C[k] = V[k]D[k]V^[k\ (5) 

where the superscript 'H' indicates a Hermitian operation (complex conjugated transposition). Denote by Dii[k], 
1 < i < N, the diagonal elements of D[k], which is diagonal per SVD transform. The vector 0,j[k] of relevant 
channel coefficients (to receive antenna j) is transformed by virtue of the SVD into 



n,[k]V[k]. 



(6) 



Because V[k] is unitary, the independent complex Gaussian random variables (r.v.s) aij[k], . . . , aj\[j[k] are trans- 
formed into a new set of N i.i.d. r.v.s. In other words, there exists an equivalent set of channels f3ij[k], . . . , PN,j[k] 
that characterizes the transmission. Therefore, for each channel use k, and each receive antenna j, 
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ii,[k]cm][k] = Y,\AAk]\'mk]. 
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By definition, C[k] has rank 1 (that is, if the set d,^^ . . . ci^^ is different from e[}^ ei^^ . . . el.^^ ); thereby, 
exactly one value among -Dii[/c] . . . DN]\f[k], be it Digig[k], is nonzero. The nonzero value must necessarily equal 
the trace of C[k], which in turn equals obviously 



trC[ 
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Consequently, the key parameter d\{e,c) reduces to 



nj[k]C[k\n][k] = Ao.„W = |A„,,W|' • 
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Clearly, there exists an equivalent set of independent complex Gaussian channels derived from the original set 
of independent complex Gaussian channels, and exactly one of them affects all of the coordinates of the multi- 
dimensional point (c^^-* c^^^ . . . c'^'') transmitted during channel use k; this means that if the nonzero equivalent 
channel Pio,j[k] fades, it will affect all 2N coordinates transmitted during the kth channel use. The essence of CI 
is summarized by 

Theorem 1 : There exists an equivalent set of independent complex Gaussian channels derived from aij [k], . . .,aj\ij[k], 
such that exactly one of them affects all (real/imaginary) coordinates of a transmitted multidimensional point 

(c«,...,cf )) gM^^, via n,[t]C[t]nJ[t] = |A„,,(t)|2E,^ 

Theorem 1 implies that, when using multiple transmit antennas and CM possibly over non-binary fields, with or 
without puncturing, it is desirable to: 

1) Interleave the coordinates of transmitted multidimensional constellation points in order to enable, and render 
meaningful, a Hamming distance with respect to coordinates — rather than complex symbols; and, 

2) Design codes for multiple transmit antennas that can maximize the minimum coordinate-wise (as opposed 
to complex-symbol-wise per current state of the art [2]) Hamming distance between codewords, in order to 
derive the most diversity from CI 

Note that, in general, CI is different from BICM [13] (see also Example 1), and does not preclude (or destroy) 
the concept of CM (via signal-space coding); this is so because CI operates on the (real- valued) coordinates of 
the complex values from the complex modulator alphabet, rather than operating on the coded bits, prior to the 
modulator [13, Fig. 1]. 

It is possible to design space-time codes for CI that exhibit diversity exceeding that of existing codes; this, 
however, is the scope of a separate work [27]. The sequel is concerned with characterizing CI from a mutual 
information perspective. 



III. Mutual information limits for CM and BICM: Is BICM Efficient in MIMO Channels? 

If the model in [13] is used to characterize mutual information limits for CM and BICM schemes that involve 
> 1 transmit antennas (only scenario = 1 was discussed in [13]) the curves in Fig. 1 are obtained; the 
numerical results were computed by an extension to MIMO channels of the analysis (and numerical approach) 
from [13]. The curves in Fig. 1 indicate a fundamental difference between the information rate limits in scenarios 
that involve finite channel alphabets, and more than one transmit/receive antennas — even for the Gray mapping 
case, which in [13] is conjectured to maximize the mutual information of bit-interleaved schemes. (It is possible 
to view CM as a concatenation between a code and a mapper, provided that the mapper is derived from some set 
partitioning labelling.) While it can be rigorously proven that CI does not change the ergodic channel capacity, the 
model in [13] predicts a gap between mutual information curves for BICM relative to pure CM (due, in part, to the 
finite size channel alphabet; see, however, the footnote in [13, p. 931]). The relevant mutual information curves are 
illustrated in Fig. 1; with N = 2 transmit antennas and M = 1 receive antenna, the gap is significant, but narrows 
when M = 2. 

Note that in CI schemes the role of iterating between decoding and detection is to circumvent maximum likelihood 
detection; the presence of an interleaver does not always preclude maximum likelihood — e.g., a symbol interleaver 
in a CM scheme does not. It should be noted that if iterative demodulation and decoding is used for CI (same as 
for BICM), it is possible to approach the i.i.d. CM capacity limit, since iterations improve the quality of the soft 
information being exchanged between the demodulator and decoder. However, this paper attempts to quantify only 
the case of non-iterative demodulation/decoding. 

To further illustrate the undesirable effect of BICM, Fig. 2 shows the relevant mutual information curves when 
N = M = 2, and 16QAM is used on the individual transmit antennas. 

From the above 2 figures, we see that BICM suffers a big loss in mutual information compared to CM in 
MIMO channels. This, together with the diversity advantage of CI shown in the previous section motivate us to 
investigate the CI in a mutual information perspective. Results will show that CI outperforms BICM in terms of 
mutual information and narrows the gap relative to CM. The following section is a derivation of mutual information 
for a CI scheme in MIMO channels. 




CM:2x2;4QAM 

BICM: 2x2; 4QAM-Gray mapping 
BICM: 2x2; 4QAM-Natural mapping 
CM:2x1;4QAM 
CM: 1x1; 16QAM 
BICIVI: 2x1 ; 4QAM-Gray mapping 
BICIVI: 2x1 ; 4QAM-Nalural mapping 
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Fig. 1. Mutual information for CM vs. BICM [13], with finite 4PSK modulation alphabets, two transmit antennas and either 1 or 2 receive 
antennas, and memoryless fading channels. The mutual information for CM was computed based on the model and approach from [13], 
using the computational tool described in [14], based on the BCJR algorithm. 



BICM 16QAM Gray mapping 2x2 
^ CM 16QAM 2x2 




Fig. 2. Mutual information for CM vs. BICM [13], with finite 16QAM modulation alphabets, two transmit antennas and 2 receive antennas, 
in memoryless fading channels; note increased gap. The mutual information for CM was computed based on the model and approach from 
[13], using the computational tool described in [14], based on the BCJR algorithm. 
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IV. Mutual Information for Coordinate Interleaved Schemes in MIMO Rayleigh Fading 

Channels 

As a result of coding and modulation, 2N (in-phase and quadrature) coordinates are generated during any channel 
use for transmission from the transmit antennas; coordinates are collected from all MIMO channel uses (over a 
long enough frame), and interleaved in such a manner as to insure that any 2N coordinates — which, in the absence 
of CI, would be transmitted from the N transmit antennas during the same MIMO channel use — are transmitted 
(as in-phase and quadrature components) from different antennas, during different MIMO channel uses; i.e., each 
of the 2N coordinates will experience independent fading relative to each other [but the same fading as some other 
2N — 1 coordinates along with which it is eventually transmitted during some channel use, per eq. (9)]. Specifically, 
let the number of MIMO channel uses in one coordinate interleaved frame be /; the coordinate interleaver size is 
then 2NI. Denote by a;^/ the 2N dimensional symbol formed by the 2N coordinates that are to be transmitted, 
post CI, from the N transmit antennas during the fc'-th MIMO channel use, /c' = 0, — 1: 

Xk' G X = {X\X = [Xo, . . . , X2N-lV ~ . . . :E(iV)]T}, (10) 

where ~ denotes a known isomorphism between real and complex vectors, and 

X{m+l) = X2mj + X2m+1 ^ M, m = 0, . . . , N - I, (11) 

is a complex symbol, from some constellation A4 (see below), to be sent from the (m + l)-st transmit antenna; the 
constellation Ai will arise as the Cartesian product of the in-phase and quadrature alphabets post CI. (The resulting 
constellation mapping after CI, Ai, will be the same with the original mapping for square QAM modulation; 
and different for general PSK modulation.) Further, denote by {c^Iq^'"^, respectively {k^Iq^'"^, the coordinates 
before and after interleaving. A coordinate 2A^-tuple {Kk}2Np^^'^~^ ' P ^ will be said to form an A^-antenna 
label, because they would all form the 2N coordinates to be sent from the A^ antennas during the same channel 
use; more specifically, the coordinates whose index is congruent modulo 2 with 0, respectively 1, will form the 
quadrature and in-phase components. 

In order to model the CI operation, let be permuted to K^^(fc)2Af+7ri(fc)' i-^-' the coordinate interleaver permu- 
tation will be TT : I— > (7ro(fc), 7ri(A;)), TTi{k) = 0, 1, . . . ,2N — 1, ■Ko{k) = 0, 1, — 1. That is, Ck is transmitted 
during the 7ro(/c)-th MIMO channel use, as the 7ri(A;)-th component of the A'^-antenna label of X)^i\f.i—^^^f^f.y The 
receiver has to detect the i-th component of Xf^r and unpermute it to obtain 0,^-1 Note that the component 
pairs X2m., X2m+i are affected by the same fade, namely the channel fading coefficient on the link between transmit 
antenna m + I and the relevant receive antenna. 

As long as the coordinate interleaver has sufficient depth (to insure exposure to independent fades), the index vro(A;) 
of the MIMO channel use that conveys, after interleaving, a (pre-interleaving) coordinate Cfc becomes irrelevant from 
the perspective of computing the mutual information; further, coordinate Ck appears as a random element vri (k) of 
some A^-antenna label (a 2A^-dimensional point), whose components are detected individually — after having been 
affected by a common fading coefficient (see eq. (9)). The random position in an A^-antenna label can be modeled 
by a switch Si, and the equivalent parallel model is shown in Fig. 3. 

The complex constellation Ai mentioned above arises as the Cartesian product of the quadrature and in-phase 
alphabets post component interleaving; in order to further define the formalism, assume for simplicity that prior 
to CI the modulator employs the same complex constellation Q on each of the A^ transmit antennas. Let Qj and 
Qq be, respectively, the alphabets pertaining to the in-phase and quadrature coordinates of points from Q prior to 
CI. Since CI mixes the in-phase and quadrature coordinates, each coordinate post interleaving will belong to an 

def 

alphabet Qu that is the union of the alphabets Qj and Qq, Qu = Qi U Qq. Therefore, let 

gu = IQuI = |Q/UQq| (12) 

be the cardinality of both the in-phase and the quadrature alphabets post component interleaving — each coordinate 
alphabet being, in turn, the union of the in-phase and quadrature alphabets prior to component interleaving. Then 
the effective (post-interleaving) complex constellation associated with any transmit antenna becomes 



M = {QiUQq)x{QiUQq), 



(13) 
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ant. 1 
quadrature 



q^-ary input, Ch. 



ant. 1 
in-phase 



q^-ary input, Ch. 1 




ant. N 
quadrature 



q^-ary input, Ch. i 



K2N-2 

ant. N 
in-phase 



K.2N-1 



q^j-ary input, Ch. 2N-2 



q^j-ary input, Ch. 2N-1 



y,j = h{ln)x{lo)+ 

El = 1 h{l)x{l) + n, 

^ k = [i/2\ 

i even^ '^{x{Iq)) = Ki 
i odd=> 5R(2;(/o)) = 



Fig. 3. Equivalent parallel channel model for computing mutual information in the presence of CI — assuming ideal (coordinate) interleaving, 
A'^ transmit antennas; depicted channel conveys one qu-ary r.v., carried in a random position of an A'^-antenna label. The total mutual 
information with CI will be 2A'^ times the capacity of this channel; y^,; is the observation of the i-th channel at the s-th receive antenna. h{l) 
represents the channel coefficient from the l-th transmit anteima to the s-th receive antenna; x{l) is the complex value, from some complex 
alphabet A4, transmitted from the l-th transmit antenna. The complex constellation Ml arises as the Cartesian product of the quadrature 
and in-phase alphabets post component interleaving; qu is the cardinality of both the in-phase and the quadrature alphabets post component 
interleaving — each coordinate alphabet being, in turn, the union of the in-phase and quadrature alphabets prior to component interleaving. 



and the MIMO constellation post CI, X, is the A^-fold Cartesian product of M. with itself, 

X = M^. (14) 

Finally, the operation of mapping real (post-interleaving) coordinates to multidimensional points is defined 
naturally, in that a coordinate simply becomes a component of some multidimensional point x ^ X, and its 
label; unlike in BICM [13], where Gray mapping has been conjectured to be optimal, the mapping of coordinates 
to A^-antenna labels is unambiguous (no multiple choices). 

The problem then becomes calculating the mutual information between pre-interleaving 2A^-tuples [cq, ci, . . . , C2Ar-i]^ 
and the receiver observations at the 2N different MIMO channel uses^ that convey, post component interleaving, 
those coordinates; clearly this model involves buffering. If CI is such as to insure that each uninterleaved co- 
ordinate is sent during independent MIMO channel uses, than the desired mutual information is 2N times the 
mutual information of an abstract channel conveying a single, qu-sry, post-interleaving coordinate, i.e. between an 
interleaved coordinate and the observations — ^by the M receive antennas — of the multidimensional point from M'^ 
that carries the corresponding interleaved coordinate. The assumption of ideal interleaving reduces, thereby, the 
relevant problem to the random switch model in Fig. 3. The auxiliary switch Si will be eliminated by averaging 
over its possible realizations from {0, . . . , 2N — 1}; this will require one to compute the mutual information of the 
2-th channel, i.e. between and = [yi j . . . yM,i]'^ in Fig. 3, where is the complex observation by receive 
antenna s of the output of the i-th channel, i is a realization of the rv. Sc, and the Kj-s are i.i.d.^. Therefore, the 
effect of CI is modeled by the operation of the switch Si, and by the assumption of independence between MIMO 
channel uses. 

Example 1: By visual inspection, one easily and rigorously verifies that when Q is an unrotated 4QAM con- 
stellation, Qj = Qq = Qu, M = Q, qu = 2 — i.e. the inputs to the parallel channels in Fig. 3 are binary rv.s, 
like in BICM; on another hand the BICM channel input is an m-tuple with m = A^log2 \ Q\ = 2N. Thereby, 
Fig. 3 reduces to the equivalent BICM model from [13, Fig. 3] that corresponds to using a 4QAM constellation 
on each transmit antenna (although [13] restricts itself to a single transmit antenna, the BICM model from [13] is 
directly generalized to > 1 with the additional averaging in the form of marginal probabilities, as discussed in 

^Per assumption at beginning of Section IV. 

^The i.i.d. assumption for the Ki-s does not hold, in general, for constellations that increase their size after CI. 



8 



Section IV-B). Consequently, BICM and CI are equivalent in the 4PSK case, for any number of transmit antennas 
N. This proves in part (i.e. for the 4PSK scenario) Biglieri et al.'s conjecture made in [13] that Gray mapping is 
optimal for BICM. 
In the context defined above, the sequel addresses the scenario when the receiver has perfect or partial chan- 
nel state information, or no channel state information — and, of course, the information necessary to unpermute 
the coordinates; the latter translates in information about the switch S^. In general, as it will turn out below, 
computing the desired mutual information in the presence or absence of channel state information reduces to 
computing I{K;y,0, Si = i) or I{K;y,S£ = i), where represents a (generalized) channel state parameter, i.e. 
a vector parameter on which the transition between channel input and output depends. The mutual information 
I{k; y, 0, Si = i) will be derived below in order to reveal the computational approach for numerically computing 
it; the modifications necessary to obtain /(k; y, Si = i) will be straightforward. 

A. Problem setting and notations 

Lower case symbols will denote scalar quantities (real or complex), boldface symbols will denote vectors, and 
underlined boldface symbols will denote sequences of scalars or of vectors. 

In general, the channel input at channel use k, denoted generically x^, can be viewed as a point (vector) from 
some real A^^i dimensional space — or, for even A^i > 2, from a complex Ni/2 dimensional space C^^^^; the 
complex vector entries can be further limited to some complex (i.e., real two dimensional) alphabet Q, in which 

def 

case the overall complex Ni/2 dimensional alphabet, denoted X, has finite cardinality \X\ = Mq, often verifying 
Mq = 2™. In the case of CM for MIMO channels the channel input relevant to computing the mutual information 
is a complex N dimensional point (vector), whereby A^^i = 2N — see [13], albeit [13] only addressed the single 
antenna scenario N = M = 1); for BICM the relevant channel input is a bit m-tuple [13], i.e. a point from W^, 
and a parallel channel model [13, Fig. 3] reduces it to a single bit, i.e. a point from {0, 1} G M, whereby A^i = 1. 
In the CI case, the relevant channel input is a point in M , and the parallel channel model from Fig. 3 reduces it 

def 

to a single dimensional real point from Qy = Q/ U Qq. 

The output y of the channel, generally from some space, can be viewed for the MIMO problem at hand 
as a point (vector) from a complex M dimensional space, with M being the number of receive antennas, whereby 
= M; this is true both for the aggregate MIMO channel and for any one of the parallel channels in Fig. 3. 

In order to correctly identify the state vector as a point from some complex space , consider the i-th parallel 
channel in Fig. 3, and note that the channel output is a collection of M receive antenna observables; even when 
the channel from any transmit antenna to any receive antenna is memoryless, the observable at receive antenna 
s depends not only on the current channel coefficients and on coordinate ki (during the current MIMO channel 
use), but also on the remaining coordinates — other than — arriving at the receive antenna s on the other channels 
during the current MIMO channel use, and acting as interference. Thereby the state vector for the i-th subchannel 
is 



Oi = . . . , /ll,M, • • • , ^7V,l, • • • , ^AT.M, Ko, . . . , Kj-l, 

def 

whereby A^' = NM + 2N — 1, and where /i/ ^ G C are complex Gaussian r.v.s of unit variance and zero mean, 
modeling independent fiat fading coefficients from transmit antenna / to receive antenna s. 

Thereby, when transmitting (and detecting!) a qu-sny r.v. k on a random position i in an A^-antenna label (see 
Fig. 3), the channel state can be viewed as a random parameter with (random) realizations 6i that depend on the 
realizations of Si. 

B. Coherent vs. non-coherent detection — pQ{y\x) vs. p{y\x) 

The important implication of the above form of the state vector is that even when the receiver knows the MIMO 
channel perfectly, lack of information about any interfering coordinates renders the problem noncoherent (partial 
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channel information, or no channel state information)^. The following discussion serves to meaningfully characterize 
the differences between coherent, noncoherent, and partially coherent scenarios. 

The assumption of coherence vs. noncoherence essentially translates in whether the relevant pdf is conditioned 
on the channel state, or averaged over the channel state. One characteristic of the MIMO channel is that the part 
of the channel state vector that depends of the multiple transmit antenna interference is always unknown (except 
possibly when talking about pilot symbols), and therefore the detection is always at least partially non-coherent; it 
will be necessary to average the pdf over the unknown part of the channel (even when the actual fading coefficients 
are known to the receiver), and this task will be naturally achieved by computing marginal probabilities. Note that 
the non-coherent aspect is responsible for the gap between CM and BICM in MIMO channels. Formally, this can 
be briefly described as follows. 

In the coherent case the channel is characterized by the family of transition probability density functions 



6 eC^' ,x eR^\y eC^'} (16) 



where the vector parameter represents the channel state. The sequence of channel state vectors from a set of 
channel uses is denoted 6. The essential assumption is that 6 is independent of the channel input x; e.g., in any 
of the parallel channels in Fig. 3 ^mi,? does not depend on Ki — clearly 6ch is independent of Kj, which makes Oi 
independent of Ki. In addition it is assumed that, conditioned on 6, the channel is memoryless, i.e. 

pe(.y\^) = IlpeSyk\^k)- (iv) 

k 

Note that in ISI channels the state 6 depends on the input sequence, and therefore, given 6, y does depend on the 
input sequence (i.e. yj, depends not only on Xk but also on x^^i, etc.). Moreover, is assumed to be stationary and 
have finite memory. OFDM systems do verify the above assumptions, as the channel experienced by the encoder 
is flat (albeit correlated). 

In the general noncoherent case lack of knowledge about prevents the channel transitions from being memo- 
ryless. Nevertheless, under ideal interleaving, and for all finite index sets wherein interleaving renders the channel 
realizations independent, EQ{pQ{y |^)} = ^QiWut^jcPOk l^'^' )} (17); because of the independence between 
the elements of the sequence the average of the product equals the product of averages, and one can define 

p{y\x) = EQ{pQ{y\x)] = Wp{yk\xk) 

where a new average transition pdf is defined as 

piy\x) = EQ{pQiy\x)}. (18) 

Using p{y \x) instead of pQ{y\x) will allow transforming metrics, decision rules, and expressions for average 
mutual information in the coherent case into same for noncoherent scenarios. 

Finally, when relying on the model in Fig. 3, the multiple input part of the state vector 0Mi,i will be unknown 
even when 0ch,i is known; averaging it out to obtain pQ^^ Xvl^) simply means computing marginal probabilities 
for the coordinate of interest Ki. This will be used in Section IV-C, and in producing the numerical results. 



C. Derivation of mutual information for CI 

In order to compute the mutual information when transmitting a q\j-aiy r.v. on a random position in an AT-antenna 
label, consider a gy-^ry r.v. k; the r.v. Se, whose outcome determines the switch position, is uniformly distributed 
over {0, . . . , 2N — 1}, and has known realizations 5^ = i. For the coherent case, where and the permutation 
pattern (i.e. Si) are perfectly known to the receiver, one must calculate the average mutual information I(k; y, 9, Si) 
as an average of I{k; y, 6, S() over Si, i.e. 

27V-1 

I{^^;y,0,Se) = ^ E ^('^^ ^' = (19) 

i=0 

''unless some form of interference cancellation is attempted 



10 



as noted, the channel state can be viewed as a known random parameter with (random) reaUzations Of, Ukewise, Si 
is a known random parameter. Similarly, in the noncoherent case, one must compute the average mutual information 

I{k; y, Si) as an average of I{k; y, Si) over Si, i.e. 



2N~1 



I{K;y,Si) = ^ ^ I{Hi;y,Si = i); 



(20) 



i=0 



and, in the partially coherent case — when 0ch is known but the multiple input component of the state vector O^i 
is unknown — the mutual information is I{K;y,6ch, Si) and must be computed as an average of I{K,;y,6ch, Si) 
over Si, i.e. 



2N~1 



I{K;y,Och,Si) = ^ J2 I{i^;y,Och,Si = i) 



(21) 



i=0 



The third case is more physically meaningful than the first (which assumes that the receiver always knows the 
symbols transmitted from the competing transmit antennas); it is the third mutual information that will be numerically 
computed in the sequel. 

Denoting by k =* [kq, . . . , K2N-ii^ the complete MIMO channel input, the desired mutual information expres- 
sions in the above scenarios are 



IiK;y,0,Si)=2NI{K;y,e,Si 



I{K-y,Si) = 2NI{K-y,Si), 



I{k- y, 0ch, Si) = 2NI{k- y, 0ch, Si). 



(22) 
(23) 
(24) 



In the sequel the subscript i of 6i will be omitted for simplicity, but is to be understood in the theoretical, perfectly 
coherent case. 

Consider first the coherent case, which requires computing I{K;y,9, Si = i) via (19). By the chain rule for 
mutual information, I{K;y,9, Si = i) = I{k;Si = i) + I{K;y,6\Si = i) = I{k;6\Si = i) + I{K;y\0, Si = 
i) = I{K;y\6, Si = i); (a) follows because I{k,;Si = i) = due to the independence between k and 5^ = i, 
and similarly (b). It suffices then to average over the conditional pdf p{k, y, 6\Si = i), and — since I{k; y, 6\Si = 
i) = I{K]y\6, Si = i) — the conditional mutual information of n, y, given and Si = i, for the case when k is 
uniformly distributed in Q\j, reduces to (see Appendix I) 



I{K;y\e,Se 



log2((7u) - E^ y Q 



logs 



where k, y, and are conditionally distributed as (see Appendix II) 



(25) 



(26) 



Above, denotes the set of all A^-antenna labels having k on position i, i = 0, . . . , 2N — 1. Finally, using eqs. 
(19), (22), one obtains the mutual information with CI in the coherent scenario, for the case when k is uniformly 
distributed in Q\j, to be 

2N-1 

IiK;y,e,Si) = 2iVlog2((?u) - Yl ^^,y,e 



i=0 



logs 



(27) 



Likewise, in the case when k is non-uniformly distributed in Q\j, 

J2zexPe(y\^)p(^) 



I{K;y\8,Si 



-E 



K,y,e 



log2 



Ecex^pe(y\C)piC) 



p{K,y,e\Si = i)=p{K)p{e) Y Peiy\C)p{C)- 



(28) 
(29) 
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2N-1 



I{K;y,0,Si) 



y E 

/ J K. 



yfi 



i=0 



log. 



Similarly in the noncoherent case, when k is uniformly distributed in Qij, 

^ EK,y log2 



IiK,y,Se) = 2N\og2iq^ 



' ECex^PeivlOpiC) 

Qu, 
Ez&xPivK 



i=Q 



Ecex^piy\0 



(30) 



(31) 



where p{y\z) is obtained as in Section IV-B; and in the partial coherent case, when the fading coefficients are 
known to the receiver, 

2N-1 

I{K-y,ech,Se) = 2iVlog2(gu) - Yl ^.,Z/,0c 



t=0 



logs 



T^zexPOcM^^ 



Ecex^P9aAy\0 



(32) 



where averaging over the multiple input component of the channel state vector, per (18), is implemented naturally 
via marginal probabilities. 

Similar expressions can be derived in the noncoherent and partially coherent scenarios for the case when k is 
non-uniformly distributed in Q^, in the latter case 

2N~1 

I{K;y,ech,Se) = - E^^yfi^^ 



1=0 



logs 



T.Cex^pec,Sy\Op{C)^ 



(33) 



V. Numerical results 

For a variety of MIMO configurations and constellation sizes the mutual information in the presence of CI is 
shown in Figs. 4-7, along with mutual information for BICM and CM. While the fading coefficients are assumed 
known to the receiver, the interfering symbols transmitted from other antennas are not known, and are averaged out 
as discussed in Section IV-B (essentially computing marginal likelihood probabilities); mathematically, the figures 
plot 7(K;y,0ch)- 



A. Complex constellations that are invariant to CI 

Complex constellations like 4PSK, 16QAM, 64QAM are invariant to CI; i.e., all possible combinations of valid 
coordinates of either type (real or imaginary) result in valid constellation points. Mathematically this means that 
Q = (Q/ U Qq) X (Q/ U Qq), as opposed to Q C (Q/ U Qq) x (Q/ U Qq). One can easily visualize this, and 
such cases are illustrated in Figs. 4, 5, 6; Fig. 7 includes two single transmit antenna scenarios for comparison. 



B. Complex constellations that grow in size after CI 

An example of complex constellation that is not invariant with respect to CI is the cross-shaped 32QAM. This 
constellation does not admit a pure Gray mapping, but there exists an optimum impure Gray coding [23] with a Gray 
penalty of 7/6 (this should be as close to 1 as possible; pure Gray mapping corresponds to Gray penalty of 1). More 
importantly, Q C (Q/ U Qq) x (Q/ U Qq), and thereby putting together arbitrary coordinates — which are otherwise 
valid — results in points that are not necessarily present in the original constellation; the richer constellation can 
send more bits per channel use asymptotically with SNR, which means that the mutual information curve in the 
presence of CI can cross even the CM curve at high mutual information values; see Fig. (8). The implication is 
that it is possible to alleviate the capacity limit by CI, at high information levels. 

It is possible to evaluate exactly the upper mutual information margin for a given, finite, constellation size; the 
simple observation is that for any two r.v.s X, Y 

I{X;Y) = H{X) - H{X\Y); 

note that, asymptotically with SNR — i.e., as the noise becomes negligible — Y resembles X more and more, and 
replicates X in the limit. Thereby, as SNR oo, H{X\Y) H{X\X) = and I[X; Y) H{X); this was to be 
expected, since the maximum mutual information cannot exceed the source's entropy. In this context X will be a 
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Fig. 4. Mutual information comparison between BICM (with pure Gray mapping) and CM with and without CI, in a Rayleigh fading 
MIMO configuration with two transmit and one receive antennas, and 16QAM constellations. 



r.v. with realizations in M. from (13). A cross shaped 32QAM constellation Q, when used on each transmit antenna 
prior to CI, gives rise to a thirty six point constellation ^A after CI (replicated N times to form when transmit 
antennas are used). In the enhanced constellation A4 there are four points that occur with probability of 1/64, 
sixteen points with probability of 3/128, and sixteen points with probability of 9/256. The set of probabilities 
of complex A^-tuples corresponding to the N transmit antennas is the A^ fold Cartesian product of this set of 
probabilities with itself; alternatively, the entropy of the A^ antenna labels is A^ times the entropy corresponding 
to one antenna (antenna streams are assumed independent). It can be easily verified that for the thirty six point 
enhanced constellation the entropy equals 5.12 bits, which after multiplication by A^ = 2 yields more than the 10 
bit information limit that bounds both CM and BICM using the original cross shaped 32QAM constellation; this 
is indeed consistent with Figs. 8, 9, and explains the cross over between CM and CI. 

A more pronounced constellation enhancement effect can be obtained by constellation rotation; the extent of 
constellation enrichment after CI will be more dramatic than in the cross-shaped 32QAM case. Rotated constellations 
may possess an advantage when CI is used, and may thereby be desirable, because CI results in an enriched effective 
constellation; grouping scrambled coordinates results in complex values that do not necessarily belong to the original 
constellation, and the richer, effective channel alphabet leads to higher mutual information (asymptotically with 
SNR, see Fig. 8), and even crosses above the CM curve in the range of interest. Fig. 8 is a very limited illustration 
because constellation enrichment was achieved simply due to the inherent structure of the cross-shaped 32QAM 
constellation (increase was marginal, from 32 to 36 points). Deliberate rotations will increase the constellation size 
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Fig. 5. Mutual information comparison between BICM (with pure Gray mapping) and CM with and without CI, in a Rayleigh fading 
MIMO configuration with two transmit and receive antennas, and 16QAM constellations. 



to larger extents, and probably cause the CI curve to cross the CM curve earlier. Fig. 10 shows the effect of rotating a 
4PSK constellation by an angle that makes the enhanced constellation be a scaled version of a 16QAM constellation. 
It might be possible to engage (activate) CI selectively, at hight SNR, in order to increase the throughput while 
delaying increasing the constellation size. 
This aspect deserves further consideration. 

VI. Conclusions 

This paper identifies a shortcoming of bit interleaved coded modulation in MIMO fading channels, in terms of 
constrained i.i.d. mutual information relative to coded modulation; while the mutual information curves for coded 
modulation and bit interleaved coded modulation overlap in the case of a single input single output channel, a loss 
in mutual information was observed for bit interleaved coded modulation in MIMO channels. An alternative to 
bit interleaved coded modulation in MIMO channels, namely coordinate interleaving, was proposed and analyzed. 
Coordinate interleaving offers better diversity, and improves constrained mutual information over the bit interleaved 
coded modulation. Effects like constellation rotation and non-uniform constellation points in the presence of CI are 
also studied. 

Appendix I 

The derivation of (25) is as follows. First, note that referring to a q\j-wy r.v. while conditioning on Si = i is 
equivalent to referring to Kj. By definition, if the realizations of a r.v. A are from A, with cardinality j^|, then the 
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Fig. 6. Mutual information comparison between BICM (with pure Gray mapping) and CM with and without CI, in a Rayleigh fading 
MIMO configuration with two transmit and receive antennas, and 64QAM constellations. 



mutual information between r.v.s A, B is 

liA-B) 



Ea. 



B 



log. 



-Ea. 



B 



log; 



P{a,b) 
p{a)p{b) 
p{b) 



-E, 



p{a,b) 



log 



'p{b\a 



p{b\a) 



If A is uniformly distributed in A then p{a) = \A\ ^, Va G A, which yields 



IiA;B) = -\og2\Ar^ -E. 



'p{a,b) 



logi 



J2aeAPib\a) 



p{b\a) 



(34) 



(35) 



Similarly, if the realizations of a r.v. A are uniformly distributed in A, with \ A\ = 2"*, then the conditional mutua 



information between r.v.s A,B, conditioned on C is I{A;B\C) = — log2|w4| ^ — Ea,b,c 



logsl^r^ - Ea,b,c 



log^ 



log2 



'2 p(b\a,c) 

, where conditioning on c has been represented by writing c as a 
subscript. Then, in the case when k is uniformly distributed in Q\j, 

EaGQu peiyli^ = a,Si = i) 



Pc{b\a) 



I{K;y\e,Si 



log2(^u) - E^ y Q 



log2 



(36) 
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Fig. 7. Mutual information comparison between BICM (with pure Gray mapping) and CM with and without CI, in a Rayleigh fading 
MIMO configuration with either one or two transmit, and receive antennas, and 16QAM, 64QAM constellations. Interestingly, while the 
2x2 BICM curve crosses below the 1x2 curves at low SNR, the 2 X 2 C/ curx'e is always above the 1x2 curves. 



But by denoting = {kq, . . . , Kj-i, Ki+i, . . . , H2N-1} one can write 

P0{y\K, Si = i) = P0{y\Ki = k) 
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Fig. 8. Mutual information comparison between BICM and CM with and without CI, in a Rayleigh fading MIMO configuration with two 
transmit and receive antennas, and 32QAM constellations. 



and 



X PQ{y\ni = a) 

aeSu 



-{27V-1) 

J 

I{K;y\e,Se = i) = -E^yQ 



hence (25) follows after simplifying q^j . Likewise, in the case when k is non-uniformly distributed in Qy, 

EaGQu Pe(y\'^ = a,Se = i)p{a 



log2 



pe{y\'^^Si> = i) 



(37) 
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Fig. 9. Mutual information comparison between BICM and CM with and witliout CI, in a Rayleigh fading MIMO configuration with two 
transmit and one receive antennas, and 32QAM constellations. 



By denoting = {kq, . . . , Kj+i, . . . , K2n-i} one can write 

PQ{y\K, Si = i)p{K) = PQ{y\Ki = k) 
K- 
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Fig. 10. Mutual information comparison between BICM and CM with and without CI, in a Rayleigh fading MIMO configuration with two 
transmit and receive antennas, using a 4QAM constellation with and without rotation. 



and 

X] Pe(y\'^ = a,Si = i)p{a) 

a&Qu 

= Yl YP0iy\ Ki = a, Kj) p{Kj)p{a) 

= Yl peiy\^)p(^)^ 



hence (28) follows. 
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Fig. 11. When rotated by = 7r/4 — arctan(l/3) and coordinate interleaved, a 4PSK constellation {±1 ± j} gives rise to a 16QAM 
constellation scaled by \/2sin (arctan(l/3)). 



Appendix II 

The derivation of (26) is as follows. Note that k, do not depend on Sg, then 

y, ^IS^ = i) = p(y |k, 0, S'^ = 0\Si = i) 

= pgiy\i^i = k)p{k, e\Si = i) 

K.- 
K- 

Finally, when k is non-uniformly distributed in Qij, (29) is proved via 

p(k, y, e\Si = i) = p{y\K, 6, Si = i)p{K, 9\Si = i) 
= Pe{y\'^i = i^)p{K,6\Se = i) 

= P0(.y\Ki = k)p{k)p{0) 

= p{^)'^peiy^'^i\'^i = i^)p{i^) 

= p{K)p{e)^PQ{y\Kj, Ki = k)p{kj) 

= p{K)p{e) E pe{y\C)p{c.)- 



20 



References 

[1] S. M. Alamouti, "A simple transmit diversity technique for wireless communications," IEEE J. Select. Areas Commun., vol. 16, pp. 
1451-1458, Oct. 1998. 

[2] V. Tarokh, N. Seshadri, and A. R. Calderbank, "Space-time codes for high data rate wireless communication: Performance criteria and 

code construction," IEEE Trans. Inform. Theory, vol. 44, No. 2, pp. 744-765, March 1998. 
[3] V. Tarokh, H. Jafarkhani, and A. R. Calderbank, "Space-time block codes from orthogonal designs," IEEE Trans. Inform. Theory, vol. 

45, No. 5, pp. 1456-1467, July 1999. 
[4] H. Schulze, "Geometrical Properties of Orthogonal Space-Time Codes," IEEE Commun. Lett., vol. 7, pp. 64-66, Jan. 2003 
[5] Z. Yan and D. M. lonescu, "Geometrical Uniformity of a Class of Space-Time Trellis Codes," IEEE Trans. Inform. Theory, vol. 50, 

pp. 3343-3347, Dec. 2004. 

[6] O. Tirkkonen and A. Hottinen, "Square-matrix embeddable space-time block codes for complex signal constellations," IEEE Trans. 

Inform. Theory, vol. 48, pp. 384-395, Feb. 2002. 
[7] D. M. lonescu, K. K. Mukkavilli, Z. Yan, and J. Lilleberg, "Improved 8- and 16-State Space-Time codes for 4PSK with Two Transmit 

Antennas," IEEE Commun. Lett., vol. 5, pp. 301-303, July 2001. 
[8] N. Seshadri and H. Jafarkhani, "Super-Orthogonal Space-Time Trellis Codes," Proc. IEEE Int. Conf. on Commun., vol. 3, pp. 1439-1443, 

28 Apr.-2 May 2002. 

[9] H. Jafarkhani and N. Seshadri, "Super-orthogonal space-time trellis codes," IEEE Trans. Inform. Theory, vol. 49, pp. 937-950, Apr. 
2003. 

[10] S. Siwamogsatham and M. P. Fitz, "Improved High-Rate Space-Time Codes via Concatenation of Expanded Orthogonal Block Code 

and M-TCM," Proc. IEEE Int. Conf on Commun., vol. 1, pp. 636-640, 28 Apr. -2 May 2002 
[11] S. Siwamogsatham and M. P. Fitz, "Improved High-Rate Space-Time Codes via Orthogonality and Set Partitioning," Proc. Wireless 

Commun. and Networking Conf, pp. IM-llQ, Sept. 2002 
[12] E. Biglieri, G. Taricco, A. Tulino, "Performance of space-time codes for a large number of antennas," IEEE Trans. Inform. Theory, 

vol. 48, pp. 1794-1803, July 2002. 

[13] G. Caire, G. Taricco, and E. Biglieri, "Bit interleaved coded modulation," IEEE Trans. Inform. Theory, vol. 44, No. 3, pp. 927-946, 
May 1998. 

[14] D. Arnold, H. -A. Loeliger, "On the information rate of binary-input channels with memory," in Proc. IEEE Int. Conf. Commun., June 
2001, pp. 2692-2695. 

[15] H.-F. Lu, Y. Wang, P. V. Kumar, and K. M. Chugg, "Remarks on Space-Time Codes Including a New Lower Bound and an Improved 

Code," IEEE Trans. Inform. Theory, vol. 49, No. 10, pp. 2752-2757, October 2003. 
[16] D. M. lonescu, "On Space-Time Code Design," IEEE Trans. Wireless Commun., vol. 2, pp. 20-28, Jan. 2003. 

[17] 3GPP2, WG5, "IxEV-DV evaluation methodology - addendum (V6)," Contribution to the Joint 3rd Generation Partnership Program II, 
July 2001 

[18] B. D. Jelicic and S. Roy, "Design of Trellis Coded QAM for Flat Fading and AWGN Channels," IEEE Trans. Veh. TechnoL, pp. 
192-201, Feb. 1995. 

[19] I. Furio and G. Femenias, "Rotated TCM systems with dual transmit and multiple receive antennas on Nakagami fading channels," 

IEEE Trans. Commun., vol. 50, Oct. 2002 
[20] A. M. Tonello, "Space-time bit interleaved coded modulation with an iterative decoding strategy," Proc. Veh. Technol. Conf, Sept. 

2000, pp. 473-478. 

[21] S. Benedetto, D. Divsalar, G. Montorsi, and F. Pollara, "A soft-input soft-output APP module for iterative decoding of concatenated 

codes," IEEE Commun. Lett., vol. 1, pp. 22-24, January 1997. 
[22] G. Ungerboeck, "Channel coding with multilevel/phase signals," IEEE Trans. Inform. Theory, vol. 28, pp. 55-69, Jan. 1982. 
[23] J. G. Smith, "Odd-bit quadrature amplitude shift keying," IEEE Trans. Commun., vol. 23, pp. 385-389, Mar. 1975 
[24] J. J. Boutros, F. Boixadera, and C. Lamy, "Bit-interleaved coded modulations for multiple-input multiple-output channels," Proc. IEEE 

6th Int. Symp. on Spread-Spectrum Tech. & Appli.,, Sept. 6-8, 2000, pp. 123-126. 
[25] A. O. Berthet, R. Visoz, and J. J. Boutros, "Space-time BICM versus space-time trellis code for MIMO block fading multipath AWGN 

channel," Proc. Inform. Theory Workshop, Mar. 31-Apr. 4, 2003, pp. 206-209. 
[26] J. Boutros and E. Viterbo, "Signal space diversity: a power- and bandwidth-efficient diversity technique for the Rayleigh fading channel," 

IEEE Trans. Inform. Theory, vol. 44, No. 4, pp. 1453-1467, July 1998. 
[27] D. M. lonescu and Z. Yan, "Full-dimensional lattice space-time codes for parallel concatenation," submitted for publication. 



